Search CORE

15 research outputs found

How Does Data Corruption Affect Natural Language Understanding Models? A Study on GLUE datasets

Author: Apidianaki Marianna
Chatzikyriakidis Stergios
Talman Aarne
Tiedemann Jörg
Publication venue: The Association for Computational Linguistics
Publication date: 01/01/2022
Field of study

Peer reviewe

arXiv.org e-Print Archive

Helsingin yliopiston digitaalinen arkisto

How Does Data Corruption Affect Natural Language Understanding Models? A Study on GLUE datasets

Author: Apidianaki Marianna
Chatzikyriakidis Stergios
Talman Aarne
Tiedemann Jörg
Publication venue: The Association for Computational Linguistics
Publication date: 01/01/2022
Field of study

Peer reviewe

Helsingin yliopiston digitaalinen arkisto

Testing the Generalization Power of Neural Network Models Across NLI Benchmarks

Author: Chatzikyriakidis Stergios
Talman Aarne Johannes
Publication venue: The Association for Computational Linguistics
Publication date: 01/01/2019
Field of study

Neural network models have been very successful in natural language inference, with the best models reaching 90% accuracy in some benchmarks. However, the success of these models turns out to be largely benchmark specific. We show that models trained on a natural language inference dataset drawn from one benchmark fail to perform well in others, even if the notion of inference assumed in these benchmarks is the same or similar. We train six high performing neural network models on different datasets and show that each one of these has problems of generalizing when we replace the original test set with a test set taken from another corpus designed for the same task. In light of these results, we argue that most of the current neural network models are not able to generalize well in the task of natural language inference. We find that using large pre-trained language models helps with transfer learning when the datasets are similar enough. Our results also highlight that the current NLI datasets do not cover the different nuances of inference extensively enough.Peer reviewe

arXiv.org e-Print Archive

Crossref

Helsingin yliopiston digitaalinen arkisto

Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging

Author: Celikkanat Hande
Heinonen Markus
Talman Aarne
Tiedemann Jörg
Virpioja Sami
Publication venue
Publication date: 10/04/2023
Field of study

This paper introduces Bayesian uncertainty modeling using Stochastic Weight Averaging-Gaussian (SWAG) in Natural Language Understanding (NLU) tasks. We apply the approach to standard tasks in natural language inference (NLI) and demonstrate the effectiveness of the method in terms of prediction accuracy and correlation with human annotation disagreements. We argue that the uncertainty representations in SWAG better reflect subjective interpretation and the natural variation that is also present in human language understanding. The results reveal the importance of uncertainty modeling, an often neglected aspect of neural language modeling, in NLU tasks.Comment: NoDaLiDa 2023 camera read

arXiv.org e-Print Archive

Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations

Author: Celikkanat Hande
Kakouros Sofoklis
Suni Antti
Talman Aarne
Tiedemann Jörg
Vainio Martti
Publication venue: 'Linkoping University Electronic Press'
Publication date: 06/08/2019
Field of study

In this paper we introduce a new natural language processing dataset and benchmark for predicting prosodic prominence from written text. To our knowledge this will be the largest publicly available dataset with prosodic labels. We describe the dataset construction and the resulting benchmark dataset in detail and train a number of different models ranging from feature-based classifiers to neural network systems for the prediction of discretized prosodic prominence. We show that pre-trained contextualized word representations from BERT outperform the other models even with less than 10% of the training data. Finally we discuss the dataset in light of the results and point to future research and plans for further improving both the dataset and methods of predicting prosodic prominence from text. The dataset and the code for the models are publicly available.Peer reviewe

arXiv.org e-Print Archive

Helsingin yliopiston digitaalinen arkisto

Sentence Embeddings in NLI with Iterative Refinement Encoders

Author: Talman Aarne Johannes
Tiedemann Jörg
Yli-Jyrä Anssi
Publication venue
Publication date: 03/06/2019
Field of study

Sentence-level representations are necessary for various NLP tasks. Recurrent neural networks have proven to be very effective in learning distributed representations and can be trained efficiently on natural language inference tasks. We build on top of one such model and propose a hierarchy of BiLSTM and max pooling layers that implements an iterative refinement strategy and yields state of the art results on the SciTail dataset as well as strong results for SNLI and MultiNLI. We can show that the sentence embeddings learned in this way can be utilized in a wide variety of transfer learning tasks, outperforming InferSent on 7 out of 10 and SkipThought on 8 out of 9 SentEval sentence embedding evaluation tasks. Furthermore, our model beats the InferSent model in 8 out of 10 recently published SentEval probing tasks designed to evaluate sentence embeddings' ability to capture some of the important linguistic properties of sentences.Peer reviewe

arXiv.org e-Print Archive

Helsingin yliopiston digitaalinen arkisto

The University of Helsinki submissions to the WMT19 news translation task

Author: Hurskainen Arvi
Raganato Alessandro
Scherrer Yves
Sulubacak Umut
Talman Aarne
Tiedemann Jörg
Vazquez Raul
Virpioja Sami
Publication venue: The Association for Computational Linguistics
Publication date: 01/01/2019
Field of study

In this paper, we present the University of Helsinki submissions to the WMT 2019 shared task on news translation in three language pairs: English-German, English-Finnish and Finnish-English. This year, we focused first on cleaning and filtering the training data using multiple data-filtering approaches, resulting in much smaller and cleaner training sets. For English-German, we trained both sentence-level transformer models and compared different document-level translation approaches. For Finnish-English and English-Finnish we focused on different segmentation approaches, and we also included a rule-based system for English-Finnish.Peer reviewe

arXiv.org e-Print Archive

Crossref

Helsingin yliopiston digitaalinen arkisto

Archivio della ricerca- Università di Roma La Sapienza